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(1.) Introduetorij, On a Generalised Theorij of Alternative Inheritance, tvith special 

Reference to Misndei/s Laws/^ 

It seeiiks likely to be of Interest at the present time to consider rather at length a 
fairly full mathematical theory of the pure gamete. We do not venture to call this 
theory a generalised Mendelian theory of inheritance, partly because it is not even the 
most general theory of the pure gamete conceivable, partly because Mendel's original 
theory of heredity was perfectly clear and perfectly simple, and is not the theory here: 
developed. The pure and simple MendeHan theory seems to have been discarded in 
the light of recent experimental results by more than one Mendelian, both in this 
country and abroad. The original Mendelian theory has been replaced by what are 
termed /' Mendelian Principles." In this aspect of investigation the fundamental 
principles propounded by Mendel are given up, and for each individual case a pure 
gamete formula of one kind or another is suggested as describing the facts, f This 
formula is then emphasised, modified or discarded, according as it fits well, badly, or 
not at all with the growing inass of experimental data. 

It is quite clear that it is impossible while this process is going on to term 
anything whatever Mendelian as far as theory is concerned. The present investigation 
is therefore not a generalised Mendelian theory of heredity : we speak of it merely 
as a generalised theory of alternative inheritance, and it is based on the conception 
that the gamete remains pure, and that the gametes of two groups, while they may 
link up to form a complete zygote, do not thereby absolutely fuse and lose their 

* I owe the incentive to this memoir to Professor W. F. R. Weldon, who had already worked at some 
of the simpler special cases and who placed his results entirely at my disposal. 

t See especially Tschermak, ' Zeitschrift f. d. landwirthsch. Versuchswesen,' Jahrg. IV. {'' Ueber 
Ziichtung neuer Getreiderassen ") ; I)e Vries, ' Ber. d. deutsch, botan. Gesellsch.,' vol. xviii. (1900), 
pp. 435-443; Bateson, ' Proc. Camb. Phil. Soc./ vol. 12, p. 53; 'Nature,' April and May, 1903. 
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identity. The analytical expression of this is represented by the fundamental 
formula : 



(A + A') {a + a) 



A 



J 



Aa 

■■ * » > » ♦ \ ■■ / 5 



where (AA^) and (aa^) are the parental zygotes, and the right-hand side of the 
equation represents the four possible constitutions of the offspring. Such a formula 
as the above may be accepted without any hypothesis as to dominant and recessive 
characters, but these terms were certainly essential to Mendelian theory as propounded 
by Mendel himself, and it becomes very doubtful whether we ought to attach his 
name to any theory which discards these '' recognition marks/' It is very convenient, 
however, to have names for the alternative elements expressed by capital and small 
letters respectively. I propose for the purpose of this paper to term an A~element a 
protogeiie, and an a-element an allogene. Two protogenic elements will give rise to a 
protogenic zygote AA, two allogenic elements to an allogenic zygote aa, and a 
protogenic and allogenic element to vvdiat Mr. Bateson has termed a heterozygote 
Aa. We may thus class his homozygotes into protozygotes and allozygotes. We 
reach pure Mendelianism by making our protozygotes *' dominants," our allozygotes, 
'' recessives,'' and our hetei^ozygotes '' hybrids of dominant character." In so far as 
our theory of pure gametes replaces protozygote, allozygote, and heterozygote by 
'' dominant," '' recessive," and '' hybrid with dominant character," it becomes a 
generalised Mendelian theory, but only in this case. Otherwise we must look upon 
it as an attempt — in one direction only of course — to give a consistent mathematical 
basis to the various formulae which have been propounded for describing statistical 
data classed under Mendelian cates:ories : shortlv we shall endeavour to develoD a 
general pure gamete theory. 

The results were worked out in a purely impartial frame of mind ; indeed, once 
state the hypotheses, and the analysis is far too complex to allow us to predict d 
'priori what can possibly result from it, nor does the investigation admit of any but 
one solution. If the hypotheses are admissible, then any narrower pure gamete 
formula must lead to results embraced under our general conclusions. 

What we have to admit at the present time are the following conditions : — 

(i.) The existence of a vast bulk of evidence that heredity, as far as measurable 
characters are concerned, follows within a population perfectly definite laws. 

(ii.) The existence of another mass of experiments, in which simple and pure 
Mendelianism is certainly inadmissible, but in which certain ratios undoubtedly 
approach the values they woidd have on such a simple and pure Mendelian theory. 

It is possible, therefore, that a generalised theory of the pure gamete would account 
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for (ii.) ; it can only do so satisfactorily, however, if it does not contradict the results 
of (i.). Hence arises the present attempt to develop in one direction a generalised 
theory of the action of the gerni-cell. 

As we have frequently had to assert, the laws investigated inider (i) have nothing 
w^hatever to do with any physiological hypothesis. That a pliysiological hypothesis 
leads to them is not nuich test of its validity— it is a necessary, bnt not stifficien% 
criterion of its correctness. If, however, it contradicts them, we are bound to 
discard it, and seek for its modification or replacement. , The present study is an 
attempt to see how fer one generalised pure gamete theory leads to results in 
accordance with the law of regression and the known nature of the distributions of 
offspring in populations. 

(2.) Nature of Hypothesis adopted. ^ 

We start with a zygote consisting not of a single protogenic pair AA, but built 
up of n such pairs, 

We suppose this to produce gametes which unite with those of a similar allogenic 

zygote 

Any element of the protogenic s^amete must unite with the corresponding^ element 
of the allogenic gamete, i,e,^ Ky with ary and by the fundamental principle (i.) above^ 
this gives rise to the four possibilities 

A-f-ar , 

wJiich are all of the same constitution. The result is the hybrid group, symbolised by 

- C^|A| -j- C^oAc2, "4" ^^^Ag -f" ♦ • » "4" 0^u^n$ 

the perfect multiple heterozygote. 

The population will now be supposed to consist of any number of such perfect 
heterozygotes, which we shall suppose to again cross* We shall now have 



IjrK X a,Ar = 


^- atU'r 




Cl'rKr 




j\.faf 



V. J:\.fl\.f 
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oiyeach couplet will give rise to four possibilities, representing, however, only three 
constitutional differences, expressed by 

Since these four possibilities may occur with each of the n couplets, we shall iiave 
wlieir two perfect heterozygotes cross, d^**' resulting possibilities. These form the 
resulting population of the second generation. Oui' fh'st problem will be to hnd tlie 
distribution of this population. This, according to Mendel, is the segregating 
generation. We must inquire into the frequency of each constitutional difference in 
this segregating generation. 

We now reacli our second limiting hypotliesis, wliich is needful if we are to ^*l)ply 
our theory to sexual reproduction. We suppose : — 

There to be an absence of homogamy (including self-iertihsation), and the meml)ers 
of the second generation to cross absolutely at random and with equal fertility. ^^ 
We have then to ask what is tlie distribution of constitutional differences in tlie 
third generation. Does the process of segregation begun in the second generation 
continue to the third, or does the population now remain stable ? Is the coriimual 
segregation into pure protogenic and allogenic individuals a necessary result of any 
pure gamete theory, or does the belief in such necessity depend upon tlie first 
Mendelian experimenters working only with self- fertilising individuals ? 



(8.) Problem 1.— To find the Distribntion of the Offspring of the Fe7feot 

Heterozygotes. 

We shall here use a symbolic form of analysis. Let 'ti stand for aa, v for a A. and 
w for AA ; then any corresponding couplets will give rise to 

■?/. + 2i) + u', 

and any one of these constitutions may be associated with one of the similar 
constitutions in any of the remaining n — 1 couplets. Hence the general distributioi 
of the population will be given by the terms of the inultinomial 



1 



(u + 2v + 'iv'Y\ 
This equals t 

11^^ 4- n'U"-^^ {2v + tv) + ^^-i^-^^^^ ^-^ t6"-- (2t) + wf + . . . + c.,^ .,, , nf'-^' {2v + ivY + .., 

■^" If one is to study heredity in populations with a view to the prol)lem of evolution, the conditions as 
to fertilisation should approach as far as possible the conditions we suppose them to l^e under in a 
natural state; we must fix our attention on the 'inass relations between successive generations of the 
population. 

t Throughout this memoir the symbol Cn, p, q ^^ "^i^ed for the expression \n / { i^^jfj^- ^£ \p Ig}- 
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Thus, for example, there would be out of the total population of possibilities 4'' : 
1 purely allogenic individual, n X S individuals with ti •— 1 allogenic couplets ; 2n of 
these would have one heterogenic couplet, and n would have one protogenic 
couplet. 

Generally there would be 3* c,,^ ,^ ^ individuals with n — s allogenic couplets, and 
these individuals would be distributed according to the terms of the binomial 

{2v + ivy. 

We are thus able to write down at once the number of any class of individual that 
can appear in the segregating generation. For example, how often do individuals 
like u'''"i''~HP'Wi appear, ^.e., individuals with n — - p — <^ allogenic, p heterogenic, and 
([ protogenic couplets ? 

To answer this problem all we have to do is to pick out the coefficient of u'^'^^'^'^vHv'i 
in the above multinomial, and the result is 

We are thus fully able to predict how many individuals of each kind ought to 
occur when a population of perfect n-couplet heterozygotes are crossed. 

Corollary (i.).-™~Let us consider only the number of allogenic couplets in thd 
distribution of the segregating generation. If we were '' pure Mendelians '' we should 
for the purpose of character classification make v = %v, as the heterogenic couplet 
would then give the dominant character* But without doing this we can assume 
V and ID to be non-iiB. 

Hence the distribution of allogenic characters in the population follows the simple 
binomial 

Afi (1 Jl. ^\n 
^ U ^ 4/ • 

Thus we see that the distribution would be a skew binomial closely approximating 
to my skew curve of Type IIL,^^' and becoming indefinitely close to a normal 
distribution of the form 

when the number of couplets, n, is indefinitely increased. 

For any value of n the mean of the skew binomial as measured by the formula ol 
the memoir oil '' Skew Variation,''! 

= ^ 4- 1 — (1 + fn) == ftt^ 



Md the standard deviation = s/'fyn^. 

Thus the mean number of allogenic couplets in the members of the segregating 
generation is |- of the total number of couplets. 



* ^Phil. Trails./ A, Yd, 186, p. 373. 
t /^^V/., p. 



toL dcnh-— A. I 
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Corollary (ii.).— The distribution of heterogenic couplets in the segregating 
generation is given by the symmetrical binomial 

4« (I + D". 

The mean number is therefore ^n^ and the standard deviation ^^^ v^|-n. 

This is a symmetrical binomial, and approaches extremely closely, even for a fairly 
small value of n, to the normal curve. We see that if any character depends solely 
upon heterogenic couplets, the distribution will be nearly normal, and the variability 
slightly greater than one depending on the allogenic (or, of course, the protogenic) 
couplets only. 

To sum up, then, so far as the distribution of characters depending upon allogenic 
or heterogenic couplets goes, we may say that a generalised theory of the pure 
gamete leads us to those norma.1 and skew distributions of frequency with which 
biometric studies of variability have made us already familiar. It would not be 
possible to base a crucial experiment on the existence or non-existence of such 
frequency distributions. The generalised pure gamete theory would, however, 
account for their appearance, which, of course, a purely descriptive statistical theory 
cannot do. On the other hand, distributions diverging much beyond the errors of 
random sampling from binomials of the above types would tell pro tatno against the 
pure gamete theory in its above form. The presence of binomials of two types only^ 
(i + -ky ^^d (^ + f )''? ought to be capable of detection, even if it would not already 
have been discovered, had it been the rule. 



(4.) Pboposition II. — To determine the Distribution of the Offspring of the 
Segregating Generation, supposing them to Mate at Random and 'without 
Differential Fertility, 

The solution of this problem may be reached as follows :— 

Suppose P any male, and Q any female, say each of n — 1 couplets, producing an 
array of offspring, which we will denote by R ; now suppose an additional couplet, the 
n}^, added to both male and female zygote. The male may be now : 

P + a,Ao or P + anKi, or P + Kt^m or P + KAn ; 
and the female may be 

Q + otn<^n, or Q + anKi, OT Q + A;,a,„ or Q + A,,A^ ; 

that is, we get 4X4 new mating individuals, with 4X4x4 new offspring 
possibilities. 

Now consider the first father P + a^a,i ; the possibilities which arise from mating 
him with the four mothers are the array R of offspring combined with any one of the 
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16 possibilities 8a;^a,, + 8a,^^A^^, or this is the same thing as multiplying the R array 
by the symbolic factor 8 (it -{- v) = 16U, say. 

The next pair of fathers 2 (P + <^«A«) with the four mothers reproduce the array R 
of offspring combined with 8 (a^.a^^ + Sa^^A^^ + A^^^A^,), or 32 possibilities. But this is 
the same as multiplying the R-array by the symbolic factor 8 (u -^ 2v + iv) = 32V, 
say. Lastly, the P + A^^A^^ father with the four mothers gives 16 possibilities of the 
form Sa^An + 8 A^^A;^ to be combined with the R-array of offspring, which is the same 
thing as multiplying the R-array by the symbolic factor 8 (v + tv) = 16W, say. 

We have at once the symbolic relation among the operators ; 

XJ -|- 2V + W = If + 2t' + tv I 

and, further, the important result that the array of offspring due to any pair P and Q 

of n — 1 -couplet parents can be converted into the arrays of offspring due to the 16 
pairs of |)arents formed by adding an additional couplet to P and Q, by multiplying 
that array by the symbolic factor 

16IJ + 32V + 16W = 16 (21 + 2v + tv). 

We have thus by induction a means of finding the array of offspring due to a 
population of parents of n couplets from the series of arrays due to a population of 
n — - 1 couplets. Since all the arrays are to be multiplied by the same symbolic 
factor, we can multiply their total by this factor. Or the distribution of offspring of 
(n — l)-couplet parents being J, that of n~couplet parents 

:= 16 (u -^ 2v -{• w) J = A X 4: X 4: , {^u + f'?^ +■ i^) J. 

Now consider parents of one couplet, their distribution is given, by aa + 2aA + AA, 
and they are to mate with the same series, aa + 2aA + -^A. 
But 

aa X aa = 4aa, 

2 {aa X 2aA) = 2 (4aa -f 4ak), 

2 (aa X AA) = 2 (4aA), 

2aA X 2aA = 4aa + 8aA + 4AA, 

2 (2aA X AA) = 2 (4aA + 4AA), 

AA X AA ™ 4AA, 

Total = 16aa + 32aA + 16AA. 

= 16 (it + 2v + iv) = 4X4X4 (^u + ^v + ^iv) 
symbolically. 

I 2 



60 PEOFESSOR K, PEAESON ON A GENERALISED THEORY OP ALTERNATIVE 

Hence, by the above proposition, the clistribntion of offspring of parents of two 
couplets is 

4 X 4 X 4 . (|t^ + \v + \id) X 4 X 4 X 4 . {\u + ^ + ^^iv) 

=:= 4? X 42 X 4^ {lu + Iv + livf, 

and, by induction^ the distribution of offspring for the random mating of parents of 

n couplets is 

4«. y^ 4« ^ 4. ^ (i^^^ j^ 1^ ^ i^^y.^ 

This, except for the constant factor 4'' X 4'', is absolutely identical with the 
distribution of the parental population, and accordingly if the next generation also 
mates at random, the mixed race will continue to reproduce itself without change. 
We therefore reach the following result :^ — 

However many coiqjlets ive suppose the character under investigation to de2^end 
itpon, the offspring of the hybrids— -or the segregating generation— if they breed at 
random inter se, tvill not segregate further, but continue to reproduce themselves in 
the same proportions as a stable population. 

It is thus clear that the apparent want of stability in a Mendelian population, the 
continued segregation and ultimate disappearance of the heterozygotes, is solely a 
result of self- fertilisation ; with random cross fertilisation there is no disappearance 
of any class whatever in the offspring of the hybrids, but each class continues to be 
reproduced in the same proportions. Thus our generalised theory lends no countenance 
to the appearance of any *' mutations" within a hybrid population under random 
mating; the only appearance of new constitutions is in the segregating generation, or 
the first generation of hybrid offspring. Except at this stage, the appearance of the 
unfamiliar is only the chance occurrence of a very rare normal variation. When we 
recollect that a purely allogenic individual is only to be expected once in a population 
of 4'' individuals, or if there be ten couplets, once in more than a million individuals, 
it will be clePorly seen that the rarity of some of the more exceptional normal 
constitutions may easily lead to their being looked upon as ''mutations," even if they 
appear in the offspring of a population many generations removed from hybridisation. 

(5.) Proposition 111 —-To find the Array of Offspjring due to a Parent of given 

Gametic Constitution mating at Random. 

This can be again deduced by the method of induction adopted in the last 
proposition. 

Supposing a male P of n — 1 couplets to mate with all possible females, and V\^n~~i 
to be the array of offspring, then we have seen in the last proposition that if we add 
an n^^^ couplet a,,a,, to P, the array of offspring due to P + a,,ari will be 16UP,,_| ; if 
we add a couplet a,An, the array of offspring due to fi^thers of type P + %A^^ will be 
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16VR;j„i, ^'^^ ^f ^^^^ ^^^ ^ couplet of form A^^A;^^ the array will be of the form SWR^^^. 
Now start with a father of one couplet ; this must be a^^a^, or ax^], or A^Aj, or in our 
symbolic notation it, v, or w ; the offspring array are respectively 8a, a^ + Sa^A^ or 
4%aj + SajA^ + 4AjAi, or Sa^A^ + SA^Ai, i.e,, 1611, 16V, or 16W. These, therefore, 
are the possible values of Ep Hence, by the principle just developed above, the 
array of offspring due to a father of type 

9 

IS 

(16Uy-^-^(16V)/'(16W)^ 

or remembering that such fathers occur with a frequency of 2^'c,^^^ ^, we have for the 
total distribution of offspring of all fathers of type 

the symbolic result 

Substituting, the following expression would give all offspring of fathers of the 
type u^'^P^HHv'^^ ie., with n — P -^ q allogenic, p heterogenic, and q protogenic 
couplets 

Therefore, given n and given p and q, it is merely a matter of expansion to find the 
array of offspring due to any special class of father. 

Corollary (i.),— So far we have supposed om:' special class of father to be defined by 
the exact couplet distribution constitutional to him,. But it is of interest to consider 
the array of offspring we get supposing only the allogenic couplets fixed in number, 
for example, in a generalised Mendelian theory if the number of recessive couplets be 
fixed, but the heterogenic and dominant, as both exhibiting dominant characters, be 
considered as indifferent. Let s = number of allogenic couplets, then we have to 
sum all arrays like 

subject to the condition that p -|- g = n — ,9. 
The result is clearly 

= 4" X 4" . c„, ,,„U^ (2V + W)"-' 

= 4« X 4" . c„, ,, , i^u + l-v)' ("J« + 1^^ + w)"-' 

= 4" X 4" . c„, , „ O + ivy {(l-w + Iv) + {v + 10)}"-^ 

This, we note, is not a pure binomial, or the arrays of offspring of a father with a 
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given allogenic constitution are not either symmetrical or skew binomials, but of a 
much more complex character. The only exception is the array of offspring of pure 
allogenic fathers^"^^ ¥/hich is given by 

4" X 4^'- X {h^ + i~vy\ ■ 

This is a symmetrical binomial This result is, of course, of special interest, for it 
gives us the distribution of offspring if the hybrid offspring were at any time crossed 
with the pure allogenic race, which was one of the original factors of the hybridisation. 
The deviation from binomial distribution in the above arrays ought to be further 
considered, for if this deviation should turn out to be very significant, it would form a 
convenient test for any generalised theory of pure gametes. 

Corollary (ii.). — -If we sum the above expressions for the array of offspring of all 
fathers of p allogenic couplets for values of s from o to n^ we have the total offspring 
population 

= 4:'' X 4^^ . ta,^ ,, ,IJ^ (2V + W)""^^^ 

— 4:'' X 4« X (U + 2V + W)^^ 
= 4« X 4'^ X {u + 2v + tvf, 

a result we have already found in Proposition II„ as giving the distribution of the 
total offspring population. 



(6.) Proposition IV.— Jb Jlnd the Mean Number of Allogenic Couplets in the 
Offspring of all Fathers having in their Constitution s-allogenic Couplets, 

By the first corollary to the last proposition the distribution of such offspring is 
given by 

4» X 4« . a„, , „ (l-tt + ^ [^u + it; + 2^}"-, 

where tj is written for J {v + w)^ a quantity which is unity so long as we consider not 
the distribution, but the total number of the non-allogenic couplets. Now this is 
clearly the sum of a number of symmetrical binomials in ^u + ^v^ and may be put 



'i,--n-~2 



= 4» X 4» . t c,,,,, (1« + W~' (2^)'. 



^ — o 



Now the means of each of these binomials can be found from the general theory of 

the binomial.! If we take our origin at n + 1 allogenic couplets, with a frequency 
zero, the mean of the first binomial, or 

■^ Or, of course, the array of sons from pure protogenic fathers, 
t 'Phil. Trans.,' A^ vol 186, p. 373, 
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(^u + ^vY is at 1 + h'^^ and its total frequency /^ == 4'' X 4'' X c,,^ ,^ ^ ; 
the mean of the second binomial, or 

the mean of the third binomial, or 

the mean of the {i + 1)*^' binomial 

The total frequency is accordingly 



n-"o 



Hence if m^ be the distance from the same origin of the mean of the above 
system of binomials 

/i X 3"- X m, =/i(l + ^) +fJ2 + ''-r-lj +... + y:.,^(|{ + 1 + !^-=JJ + ... 



/, { 1+ ^- + 2 (n - .) (2 +."~^1) + 2^ L^-ilL^ -..^- 1) (3 + 



n— 2 

2'" 



+ -+^^ 1:2: 3... ^ a+l+ ^ j+...| 

Now 

(1 + 2xY-' = \l + 2{n-s)x + 2^ i!i::iAtt^JJILll »2 4.... 

Multiply by x^, difierentiate both sides and divide by 2, finally putting x := I, and 
we find 



+ 2' ( y^ - g) (w - g - 1) ... (w - g - -^ + 1) i+..2 _|_ ^^^ 



Hence we deduce 



or 



m^ = 1 + g^^ + 3 (^^ — <!5^). 



64 PEOFESSOE K. PEAESOI^ ON A GENEEALISEl) THEOEY OF ALTEENATIVE 

But the mean of the whole population of offsprmg is at 1 + f 7^ from our origin. 
Thus we have the final results : 

Mean number of allogenic couplets in offspring of fathers with s allogenic couplets 

— ^ji — -|- (ii — s) allogenic couplets. 

Deviation from mean of general population of this array of oflCspring 



— l"'^'^ "™" 3 



{n ^ s) -'^^{4.s -- 7h). 



Deviation of fathers from mean of population 

. ^ ^ — |-7i = |- (4.9 — n). 

rill 

1 llUS 

Deviation of offspring from mean of population 
Deviation of fathers from mean of population 



3 



We have then the following results, which could certainly not have been fore- 
seen : — 

(a.) The regression is constant for all arrays, or the regression curve is a straight 
line. 

(b.) The slope of this straight line is ^^, or, since we liave seen that the population 
is stable, the parental correlation is -g- also. 

Now these results are of very singular importance, A very general theory of the 
pure gamete type leads to linearity of the regression curve, a result amply verified 
by observations on inheritance in populations f' and this result is quite independent 
of the number of couplets supposed to form the total character of the parent, or of 
the fact that in this case the arrays of offspring are skew and do not obey the normal 
law.t Further, the value of the correlation reached is numerically identical with the 
value obtained by Fkancis Galton in his original investigations on the inheritance 
of stature ! The generalised theory of the pure gamete is thus shown, whatever the 
number of couplets taken, to lead to precisely the chief results already obtained by those 
who have studied heredity statistically. So far then it might appear that a 
generalised theory of the pure gamete was capable of being brought into accordance 
with the chief results of biometric experience in heredity. This would undoubtedly 
be a great step forward, as linking up pei^ectly definite inheritance results with a 
physiological theory of heredity. Unfortunately the whole drift of recent biometric 
observations on heredity emphasises three points: 

Mr^^".-— That the parental correlation appears to be markedly greater than ^, nearei" 
to '45 to '5. 

* Galton, ' Natliral Inheritance,^ p. 96 ■ ^ JBiometrika,^ vol. 2, pp. 216 and 362™3, 
t This is further demonstration that linearity of regression has nothing whatever to do with the Gauss- 
Laplacian law of errors, i,e., normal curves or surfaces. 
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Secondly. — -That this correlation appears to vary slightly from charactei^ to 

character. 
Thirdly. --Th^t it does not appear to be absolutely the same for all species. 

It is most unfortunate for this general theory of the pure gamete, that it throws 
the Mendelian back into the position of the biometrician of 1885/^ One might have 
hoped that the generality involved in n couplets would have led to the requ.isite 
elasticity, or, failing this, to a numerical value of parental correlation nearer the 
cluster point of existing measurements than -J. We can only say, at present, that a 
generalised theory of the pure gamete leads to precisely the same general features of 
regression as have been observed by the biometricians, but it appears numerically too 
narrow to describe the observed facts. 



(7.) PnoPOSiTioisr Y.— To find the Standard Deviation of the Array of Offsjyring 

due to Fathers with s-allogenic Couplets. 

We have to find the standard deviation as of the combination of binomials dealt 
with in the previous proposition. Each component standard deviation must, of 
course, be weighted with the total frequency of the component, and there must be 
the proper reduction to the mean of the array as a whole. 

The {i + 1)^^^ binomial (|-ti + -J^-' )''""* ^^^^ ^ ('^^ "~ ^) i ><^ i"t for its standard deviation, 
and the distance of its mean from the mean of the array 

= b'^ + i - (^ + 1 + i (^ -» i))} - {^n — I {n - s)} 
— i (n-s)- ii 

Further, the frequency of this component is 

— P Qi -f 

We thus see that it contributes 
to the total second moment about the mean of the array. This gives us 

2 



A X 3«- X cr/ =/iSc„_,,„2' \^-^-^^'^ + 



n — s I 



\ 3 2/ 

■^ " Galton's law makes the amount of inheritance an ahsolnte constant for each pair of relativeB. It 
would thus appear not to be a character of race or species, or one capable of modification by natural 
selection." More ample statistical experience of populations since 1885 shows that ahsokde constancy of 
the heredity coefficients is not consonant with actual measurements.—' Roy. Soc, Proc.,' vol. 62, p. 411, 

t *Phil. Trans./ A, vol. 185, p. 373. 
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or 



d o"/ ^- ia^™. i . i i -7 + - --- /^ ^ --V — ^ ^ ^ - 



i^ow 



A.™.,i,0-' ,^ . g . . ^ 



( X "j" ZiJUj ~*« ^C,j^.„g I Q \/jnAj} 



o J 



and by differentiating 



,' rvA — 1 



Repeating the process 

4 (ti — 5) {n »^ 5 ~ 1) {1 + 2;ry^^^°^^ ^ tc,,^s, .:, 2^'^' (i — 1) ^^'""l 

Hence, putting a^ = 1, we have the required expressions on the right of the above 
result, or 



Therefore 



o-/ = In - I (n "^ 5) = 3^ {5n + 45). 



Now the standard deviation of the whole population, as far as allogenic units are 
concerned, is 



Thus 



\/-if (5 + 4 ^ . 



This result is of singular interest. The variability of an array of offspring 
corresponding to a father of given allogenic constitution is not independent of the 
father^ but increases steadily from a minimum of \/~i-^n, when there are no allogenic 
couplets in the father, to a maximum of \/\n when there are only allogenic couplets. 
In other words, fixing our attention on the same character, let the offspring of the 
hybrids inter se be crossed first with one pure race, and then with the second pure 
race, ^.e., first with pure allogenic and then with pure protogenic individuals, there 
ought to be a marked difference in the variability of the resulting offspring in the 

two cases. 

Corollary (i.).-"-In the theory of linear regression as apart from the theory of 
normal correlation on the basis of the Gauss-Laplacian distribution,'^' if 0- be the 
standard deviation of any character, and r its correlation with a second character, 

o* \/l — r^ 
^ Yule, ^Journal of Royal Statistical Society,' yoL 60, December, 1897, 
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is the mean standard deviation of all the arrays of the first character for a given value 
of the second. This expression is no longer the actual standard deviation of each 
array. 

It is of interest to see that this general law of linear regression is verified in the 

present case. We have a = ^/yqII and r = |-. ECence if 'tm be the mean standard 
deviation of the arrays, v/e should expect 

^9. . 3._^ /I .^ i\ — Lm 



Bemembering the weight of each array,^' we have 



bn + 45 



4.. ^ 4.. ^ 4. ^ ^^^ ^ 'g' ^ 4.. X 4X, .,o3«^^ gg 



8—0 



= 4^^ X ^' (1 + 3)'^ -i^n + 4^' X 4^4^*^! f^ 
= 4:'' X 4^^ X 4^^ X in, 

whence ^n? = i^, as we anticipated. 

Corollary (ii.). — It is clear that some arrays of oflPspring will be more, others less 
variable than the general population. The standard deviation of an array will be 
equal to that of the general population when s is found from 

3V (5n + is) = i%n, or s — i^n. 

Now the mean number of allogenic couplets in the general population is \n. Thus 
the offspring array equally variable with the general population is at distance -f^n 
from the mean. But a for the general population = s/tq"^* Hence, if we take 
fathers deviating from the population mean by \/~i-i-n X a*, we should expect their 
offspring to be equally variable with the general population. Supposing, therefore, 
the theory under discussion were true, we should have a means of finding, at least 
approximately, the number n of couplets corresponding to the character under 
consideration. All we should have to do would be to find the standard deviation of 
each array of offspring corresponding to a given father ; these standard deviations 
ought to increase or decrease steadily across the table, their squares giving a straight 
line when plotted. Smooth the results, interpolate a value equal to cr, and we shall 
have the character of the father whose ofispring are equally variable with the general 
population ; but the deviation of this father from the mean ought to be \/xQn X <f. 
Hence, since cr can be found, we have at once an approximate value of fk Approxi- 
mate only, of course, because our arrays are classed by units of measurement, inches^ 

^ The number of offspring in the array due to the fathers with 5-allogenic couplets is found at once 
by putting u-^'V = wm the formula of Corollary (i.) on p, 61, and equals 4^ x 4^^ c^j,, s) o3^ 



O 
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centimetres, &C.5 and each such unit will not, as a rule, represent one allogenic 
couplet ; but interpolation ought to give a result not widely divergent from the 
truth. 

The method would of course fail practically if ti were very large. For example, if 
n were 48, the deviation of the required group of fathers would be ocr, and hence 
such a father would only occur once in 1000 individuals. In a manageable population, 
therefore, we are very unlikely to have enough such fathers to form any reliable 
measure of the variability of their offspring. At the same time, the squares of the 
vai-iabilities of the arrays of sons due to quite frequent fathers ought to give a 
straight line, and if this line be determined properly, there should be no difficulty in 
finding the theoretical position of the above father, and so finding 11. 

Many other physiological theories besides the present might give this peculiarity 
of the diminishing variability of the arrays of offspring as we pass from, one side of 
the correlation table to the other. Such changes in variability are familiar to those 
who have had to deal with skew correlation. But, as far as we are aware, they have 
not hitherto been noticed in inheritance tables. The existence of this changing 
Variability would not affect in any way the general theory of linear regression apphed 
to heredity in populations. It would, however, lead to an innnediate extension of 
that theory consisting in the tabling of the standard deviations of the arrays. 
Should the standard deviations of these arrays show no bias towards a linear 
distribution, but only the fluctuation to be expected from random sampling about 
the mean value <r ^/ {I -^ t'^), we should have a strong argument against the present 
general theory of alternative inheritance. We seem liere, therefore, to have a crucial 
test of the validity of the theory, which may be quite as easy to apply as the previous 
test of the numerical value of the parental correlation. 

Of course the results now reached are not consistent theoretically with normal 
Correlation surfaces with their elliptic contour lines. The fact that Mr. Galton came 
to his elliptic contours in the first instance on the basis of his observations, and not 
from any theory,^^ shows that they must in the case he was dealing with be approxi- 
mately correct. Further, there is no doubt that in other statistics for characters in 
man there is within the limits of random sampling a close approximation to normal 
distribution. It might be hard to consider that such a deviation as would arise with 
a continuously increasing variability of the arrays from one side to the other of the 
table could exist and escape notice, had we not in physics had evidence that theory 
has often led to the discovery of an obvious relation which time after time must have 
been overlooked by previous observers unprovided with the theoretical hint of what 
to seek for. Hence while we may say that the parental dorrdation gi^en by thel 
theory is too rigid for the facts, we must leave this second test until more careful 
examination ad hoc has been made of the ample existing data^^ 



•7^ i 



Natural Inheritance/ p. lOl, 
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(8.) PnoPOSiTiON VI.— -To find the Array of Ojf spring due to a Grandfather of 
s-allogenic Couplets, siip)p)osing Complete Random Mating in the Population. 

The general distribution of the population is 

The fathers with ^-allogenic units in their correlation are given by 



On^.^o^l' {2V + '^^^^y 



i — s 



The array of oilspring due to these fathers is simply obtained by writing U for u^ 
V for t\ and W for tv, and multiplying by 4^' X 4''. This is a general rule for getting 
the offspring from any father if he mates at random. It gives us, as on p. 61, for the 
offspring distribution 

4'^ X 4^'^ X c,,,,.,/u^{2V + wy^ 



'il-"S 



4" X 4" X c„,,„„ (i« + ivy {(!« + I'') + {v + w)] 



n—s 



To get the offspring of this array treated as fathers and mating at random, we 
have only to repeat the process, and we find 

Otlspring of grandfather of ^'-allogenic couplets 

:= 4" X 4" X 4"- X 4"c,„,„ (fU 4- fV)' {^U + fV -f W]"- '^ 



4*'c,„,, „ (|tt + J-y + iioy (|« + ^v + iw) 



71 -S 



^"'■c, s, (f « + i^y (|« + fe + 1 {'0 + iv)f-'^ X (|)"-^ 



where 



e 2^ -It? -4" "5 ^^ 



5 



and is equal to unity if we identify v and to as something not allogenic. This can be 
dealt with exactly as in Proposition IT. we dealt with the array of offspring due to 
a father of 5-allogenic couplets, i.e., by analysing the array into the sum of a number 
of weighted binomials ; in this case all skew* 

Writing as before, ?^ = |^ (i; + w), we have to expand 

(l^4-|e)*(i« + |e + !>?)"-'. 

Thi general term is 

This haa a total frequency 0,,^^, i, o {iY X f^ and its mean is at a distance ^+ 1 + K'-^— ^^ 
from the origin which is taken at {71 -j^ 1) allogenic couplets. 
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The total freqiieiicy of the array is (1 + "l)"""'/i- Hence, if 'ni[^. be the mean of the 
grandchildren measured from the same origin ^ we have 



f, X (ly^"^ X < -=y; {1 + in + l {n ^ s) {2 + 1 {n ^^^ 1)} 

+ (I) ^-- -I « q \- - ~-~^- ('^' + 1 + |(rt"-i)} 



-/; {(> + 1) X (I-)"-" + u ('* - ^) (1)""^"} 



or 



m\ = 1 + in + |;-|;|- (n — 5) == 1 + in + -|- (n — 5). 



Thus 

Mean of grandchildren :::^ in — -| (n — s). 

Deviation from general population m^ean ^=- j^n — -j^ (n — .9) ^^ 2^4: (45 — n). 

Deviation of grandparent from, general population mean ^=^ ^ — -In = ^ (4^ — n). 



Deviation of offspring* 
Deviation of grandparent 



6^ 



This ratio is the same whatever be the allogenic constitution of the grandparent, 



(9.) Proposition VII. — To find the Array of OJf scoring chte to an m*^ Great-grand- 
father of s- allogenic Couplets, supposing Complete Random Mating in each 
Generation. 

The array due to a father of s- allogenic couplets is 



4^^ X ^" X c,,^,,o {i ^^ + v) Y li (^-^ + "^0 + (t^ + lo)} 



11— § 



and, as we have already seen, we must multiply by 4'' X 4'' and put ^ {n + v) for u^ 
^{u -^ 2v -\- iv) for V, and K^^ + ^c;) for tv to get the array due to the grandparent of 
s allogenic units. This process must be repeated m times if we vdsh to obtain the 
array due to the m^'^^ great-grandparent. 

We must first investigate what happens to | (u H- i;) if this interchange be made ni 
times. Suppose that it has been done i times, and let the answer be 

M,i(^ + t;) + MVH^ + H 
Repeat the operation, and the expression becomes 

(I M, + iM',) i{u + v)+ (l-M, + |M',) -I {v + tv), 
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or 



M,^, = |M, + iM',- , 



' -m, + m', . 



Therefore 
Hence 









^ , ^ i — i /]VF- — -l-'^ ::::::: ^ ^ (W. ^ Pi := ^ ^ 



Hence, finally^ 






Thus the result of m changes on ^ (u + -y) is known„ 
Similarly the result of m changes on |- (^^ + w) is 

We can now write down the array of offspring due to an m"' great-grandparent of 
5-allogenic couplets. It is 



\yr u + v , -J,., V + w 



S 



(4''X ^''Tcn,,JM.,„--^y-\-l^ 



X |(M,„ + 2M',„) ."-+ " + (M'„ + 2M,„) 'i -| '" 
We must now find the mean of this array. For brevity let us write 



n —■ s 



m 



^ (it 4- v) + M^^ ^ {v + iv) = [M + ^^ 

(M. + 2M^.) ^^4-^ + (M^. + 2M.0 ^^^^ - M.„ + 2M^^^ (^^ + Xe + vril 



2 ' ' '' ' ^^^^ 2 M 

where 



;;« 



/ 






»- }?i 



^ 2 (M„2 ~ WJ) ^ 2 (M„ - M'„) 
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Hence we have to find the mean of the system 

(fjili + key (fiu -4" Xe "4- vriy^""^'\ 
The t*^' comj3onent binomial of this sum of binomials is 

It therefore has its mean at a distance 

from {71 +1) allogenic couplets, and a frequency given by 

The total frequency of the whole array = (1 + ^y^'^^fy 

Hence, taking moments round the origin at n + 1 allogenic couplets, we have, if 
m\ be the mean of the array, 

(1 + vy^'^^fi X ^r/,, ^=- J'\ \ 1 + kn + p {'71 — s) (2 + ^ ('^^ ^ 1)) 
+ ^ i:^-.^^ '^L- 1 -J:) (3 + x{n ~~ 2)) + ... 

+ .'■ ('^ ->•) (" ~ s - 1),.. (^ ™ s - ^ + 1) (! + ,• + x{n - i)) + .. 

Summing and dividing by (1 + v)'"'~^ we find 

Hence the mean, number of allogenic couplets in the members of the array 

r=: r?. + 1 — m^s = "ii (1 — X) — {n -»-" s) — - (l — X). 

1 »~p ^ 

Deviation of oflPspring from mean of general population 

F ( 1 -— X) [ 1 1 ™ X 

= s— ™" n if -^ '— - > • 

1 + z^ L ' 1 + V j 

We now substitute for v and X in terms of M;;^ and M^^^, and find 



4 -|"" . "j, ""■ 4 6 \-L "T ^^^' m) — "-- '-J2"" - --- j_2 --^ 
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Hence: Deviation of ofispiing — y V " irm ' ^^^^^ ^^^^ deviation of m^^^ great-grand- 
parent = S -— ^71. 

Thus we have 

Deviation of offspring __ i 1 

Deviation of ni}^^ great-grandparent ^ 2"' 
This result is independent of ,s* and of n. 

Thus we conchide : 

(i.) The regression of offspring on any individual ancestor is linear ; 
(ii.) The correlation coefficient is halved at each stage in ancestry; 
(iii.) The result is perfectly independent of the number of couplets introduced into 
the forinrda. 

The first two results are very familiar to biometric workers in heredity. 

The actual numerical values of the grandparental, great-grandparental, great-great- 
grandparental correlations are -^, f^, 7^4? ^^• 

These are distinctly less than the values so far reached for ancestral correlation, 
the grandparental correlations, for instance, lying betAveen *2 and 'o. 

The results show, however, that a general theory of the pure gamete, embracing 
the simpler forms of the Mendelian principle, leads us directly to a series of ancestral 
correlations decreasing in a geometrical progression. Thus, when we suppose a 
population arising from hybridisation to cross at random, we find that it obeys the 
second fundamental assumption of the biometric theory of heredity. "^^ In other words, 
ancestry is of the utmost importance, and the population, follows laws identical in 
form with those propounded in the biometrical theory on the basis of a linear 
regression, multiple correlation. Only the values of the constants deduced for the 
law of ancestral heredity from the present theory of the pin*e gamete (Avhich appears 
to cover the bulk of Mendelian formulaa hitherto propounded) are sensibly too small 
to satisfy the best recent observations on inheritance., 

It is of interest to find ''Mendelian Principles" when given a wide analytical 
expression leading up to the very laws of linear regression, of distribution of 
frequency, and of ancestral inheritance in populations, which have been called into 
question as exhibiting only a blurred and confused picture of what actually takes place. 

It would be an immense advantage if we could accept such a theory of the pure 
gamete as has been here analysed as a physiological basis for the theory of heredity. 
We should then have a physiological origin for the ideas of regression and of ancestral 
inheritance which statistics of heredity in populations have made familiar to 
biometric workers. Unfortunately, even such a general pure gamete theory as we 
have here dealt with, while leading to results which form a special case of the law of 
ancestral heredity, is not sufficiently elastic to cover the observed facts. l\he lesson 

^ ' Biometrika,' vol. 2, p. 220. 
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to be learnt from the present investigation is, however, that there is no essential 
repugnance between any of the main results of the biometric school and a theory of 
the pure gamete, but, on the contrary, it is perfectly possible to test such theories by 
biometric methods. We may fairly ask anyone who propounds in future a Mendehan 
or pure gamete fornmla as a general theory of heredity, to remember that it involves 
in itself definite laws regulating the reproduction of a population mating at random, 
and that it is incumbent on the propounder to test whether or not such laws are 
consistent with what we already know of the inheritance statistics of such popula- 
tions. When we remember that deducing all the effects of such a formula within the 
whole field of inheritance will almost always form a very laborious piece of 
mathematical analysis, there seems a touch of scientific irresponsibility in propounding 
an immense variety of fornmlse to suit one or other special case, and the modifying or 
withdrawing them when they are found to fail in another. 



(10.) Proposition YIH-—T0 find the Regression and Correlation of Brethren 

on the Theo7'ii developed, in this Fa/per. 

We shall suppose the group of brethren to coiisist of 4;)^ members, or any pair of 
parents to have a family of 4;^. 

Consider first parents of one couplet only, the offspring of the 1 6 possible pairs are 
ifiven in the table below :— 







Father. 






a + a. 
2x{u-vv) 


a + Ae 

2xOi + ?') 
X {}i + 2-?; + -z/;) 
X {it + 2v 'f V)) 

^xij^'Vtv) 


1 
A + ^^. A + A. 

1 


Xh 


"" a + a 

a ~j- A 
A + ft 
^A + A 


2x(^* + '?0 
X {U' + 2v + '^6') 
X {u + 2y + 'w) 

2x {^o + v) 


2x (^ + v') 

2x {w + v) 

ix^o 



Now let us take every pair of brothers out of each of these 16 families and form a 
correlation table of brothers. The following table gives the distribution of the 
various types :— 



- 




Fh'st brother. 




■IL 


i\ 


'W, 


^ ^ '7/. 

'^ / Tj 

•^ 1 

?-i .0/. 


4x (9x -- 4) 
24x''^ 


24x= 
80x^ - 32x 

24x-2 


4x^ 

24x^ 

4x (9x - 4) 
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This gives a total of 256x^—6ix= 1^ X ^X ('^^X""" P^^ii'^ ^^ brothers, as it should, 
every brother in 16 families having ^x — - 1 bretliren, and there being 4^ in each 
family. Clearly we can divide by 4x, and we have the simplified form :— 





Ul, 


%. 


Wl. 


9x-4 
6x 

X 


6x 

20x-8 

6x 


X 

6x 

9x-4 



where the subscripts 1 and 2 refer to the first and second brothers. 

Let us simplify this by considering only the allogenic elements, rj denoting either a 
heterogenic or a protogenic element. Then we have :— 





Uh 


^1- 


W2 


9x-4 
7x 


i 

7x 
41x-12 



Thus the distribution of pairs of brothers in the case of the character being fixed 
by a single couplet is 

(9x - 4) u^u^ + 7x^m + ^X^hVi + (41x - 12) o^^tj^. 

This is to be read as follows : there are 9;)( — 4 cases of both brothers being 
allogenic, to 41^ — • 12 cases of neither brother allogenic, to 7x + ^X cases of one only 
of the two brothers being: allogenic. 

Now when we pass from a character fixed by one couplet to a character fixed by 
two, the above distribution can occur in either couplet, and e^ery possible pair of 
brothers will be got by squaring the above expression. Proceeding in this way to 
n couplet characters, we have the following symbolic expression for the distribution 
of brothers 

(4x)' X {9x - 4) w,% + rx%% + 7x%^i + (41x - 12)7?^^^}% 

the omitted factor 4x being restored. 

This, I think, represents the distribution of a character measured by allogenic 
couplets in a population of pairs of brothers, i.e.^ is the correlation table for brothers 
in the population—any term involving u/tt^'^ representing when we put t^j and ij^ 
equal to unity the number of pairs in which the first brother has p and the second q 
allogenic couplets in their constitutions. 

Hence if we find the coefficient of uf in terms of t%, we shall have the array oi 

L 2 
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brothers to be ibuiul associated with a brother of p allogenic couplets. The above 
expression may be written 

M" UPx - 4) u^ + 7x7/1 \ III + {7x% + (41x - 12) i?i} ^J'^ 

The term involving iif is 

(%y' {(^X — ^) % + ^X'^iP' {7X% + (4"lx ^ 12) ri^Y'^^'r^:^^"^^,^.^,^,, 
Neglecting the constant factor, the distribution in ?io is given by 



n~p 



{(9x - 4) u^ + 7x^i}^^^ {Ix^h + (41x - 12) r^J' 

We require to find the mean of tliis array. 

Put 

9y — 4 7y 4 (5v — 3 



16x — 4 16x -" 4 7x 

Then again, but for a factor independent of the power of u^, the array may be 
read 

The general term is therefore 



(XM3 + /^7;,)'"+'(i'>?,)""''''' 



''■-'-,? „.^j. gj 5 



and we must sum from 5 = o, to s =:= n — p. 

The general term has therefore its mean, at the distance 1 + /x (p + -^9) from 
p -f- s + 1 allogenic couplets, or its mean 

= (1 — ju.) (2^ + .s'), allogenic couplets. 



l — //, S, 



and its total frequency -^ p^^-p-^^ 

This gives a total frequency of the array proportional to (1 + p)''""^', 
Hencej taking moments, we have for the mean m of the array given by 



.S" = 'il — ■/) 



m X (1 + vf'-^" = S {{1 ^ fi){p + s) p^^-^-^-'e,,^^^ ,. ,} 



= (1 "^^ jw.) p (1 + !;)'''"■' ^^ + (1 "^ ju.) (r^ ■-" p) (1 + vf 
Therefore 

1 '— UL , P (1 — ^ a) 

Hence we see that 

(i.) The regression between brothers is linear. 

(ii,) The fraternal correlation which is equal to the regression = 

(5x "^ 3) 
tj ( 4i'y "'*-" 1 ) 
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and is quite iiidepeiideiit of the number of couplets, it is, liowever, a fuiictioii of i(. 
tbe size of the family used in forming the table. We have the following values : ~- 



Size of family. 


Value o1 


8 




X = 2 


12 




3 


16 




4 


20 




5 


24 




6 


32 




8 


40 




10 


CO 




00 









Value of fraternal 
correlation. 



•3333 

•3636 

•3778 
•3860 
•3913 
•3978 
•4017 
• 4067 



The value of fraternal correlation thus varies with the size of fche family dealt with 
from *3 to '4, Probably the more correct way of looking at any fraternal correlation 
table would be to suppose it a random sample of all the pairs of brothers which would 
be obtained by giving a large, or even indefinitely large, fertility to each pair, for 
what we actually do is to take families of varying size and. take as many pairs of 
brethren as they provide. In this case we ought to reach a fraternal correlation of '4 
precisely the value reached by the ancestral law when we take B'rancis Galton's 
original series of ancestral correlations.^^ 

Thus we conclude that on the general theory of the pure gamete here dealt with, 
the fraternal correlation is slightly larger than the parental. This is in accordance 
with the general result of biometric investigations on populations. But the value, as 
in the case of the parental correlation, is very sensibly lower than the value — about 
*5 — "found from recent investigations on man. It is further very inelastic even if we 
allow for some variation, in the size of families dealt with. There can be little doubt 
that fraternal correlation varies from character to character and species to species in 
a manner sensibly beyond what can be accounted for by differences in the size of th 



s 



e 



family dealt with.t 

Corollary. — We can exhibit the regression in the form : 

Mean of array — mean of general population 



p 



V I {deidation of brother from mean of general population} 



by observing that 1 — /x = -^(1 + ^i^)? whence 



wi 



, (1 + ^m) ! 5^(1 — m) 



* 'Roy. Soc, Proe./ vol. 62, p. 410. 

t There is sensible variation even for different characters, when we take the same series of pairs of 
brothers, and only one pair from each family. 
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01" 



m - ^11 = \ ' ' i'P ~" ill). 

(IL) Proposition l^.-—To find the General Formula for Biparental Regresdon on 
the Theory of the Pure Gamete^ and the Value to he given to tJie " Mid-parent^ 

If we applied without further consideration the general formula for biparental 
regression to this case, we should have^ if m.pg be the mean of the ofispring due to 
lathers of p-allogenic couplets, mated with mothers of ^/-allogenic couplets, 



ra.p,j =r -^n + i- ( p — 7|-/i) + -^ (q — -^n). 



This follows at once, since the mean of the general population '=:^ |?i, the regression 
coefficient for either parent = |, and there is no assortative mating. 
Hence we should have 

Now suppose both parents of pure allogenic race, then p ==: q r- r?, and all the 
offspring will be of pure allogenic race, or we must have uipr^ = n. 
But the above formula o:ives 

3 

n^jq '— %'n^ 
which is not correct. 

In other words, while the above formula gives the best plane to fit the array of 
points determined by the parental constitutions, that array of points does not trxdy 
lie in a plane. Or, although the simple regressions are linear, the co]X).pound 
regression is not truly |)lanar« We have therefore to find its true form, and measure 
the amount of deviation from the truth involved in using a biparental formula of the 
type indicated. 

Given a character resulting from u couplets, we require 4'' X 4'' individuals, A" male 
and 4:" female, to form the whole possible system of random matings. In such a 
population there would be 

^y^;j,o3'''~'' fathers of p-allogenic couplets, 
and 

^4,r/,o3''"""''? mothers of (/-allogenic couplets. 

The chance therefore of a mating of a p-allogenio father and a g-allogenic mother is 



(\p,o ('n,q, 







p"-p-^/4^\ 



and when n be even moderately large, this gets very small if p and q at all nearly 
approach u. For example, if n were 5, and father and mother were both pure 
allogenes, the chance of such a ptu?e allogenic mating would be only 

1/1,048,576, 

or in a population of a million would hardly occur once. 
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Still keeping n -= 5, take p = q 



4 



pi zzz 4. Then we have the chance of such a 



matnisf 



1/4215 



still extremely improbable. 

Thus when n is even moderately large, pure allogenic matings are so rare that they 
have vanishingiy small influence in the population, at large. Even if n were 2, the 
cliance of a pure allogenic mating is only -^l-^. These points nnist be borne in mind 
in what follows. 

Consider first a father and a motliei- of one couplet each, their zygotes are either u, 
V or it\ involving a gametic constitution a, + a, a + A or A + -A* We have the 
following scheme : - -• 



Zygote of fcit}i(3r. 


Zygote of mother. 


Niimbei' 


of ma tings. 


Offspring, 




1/, 


II, 






4:U 




\ >' 


1' 




2 


2 {u + v) 




11 


w 






4-/1' 




r 


II: 




2 


2 {v + -ii) 




\ '' 


'' 




4 


u-{-2n + w 







w 




2 


2 {W + /:) 




j w 


II 






2 {V + /;) 




to 


1) 




2 


2 (to + •/•) 




w 


w 




1 


4:W 





Hence ii there be 

1 allogenic couplet in father and 1 in mother. 



spruig 



. . 








Let us write 







All, 

4 {ii + 2v), 
4 (2'V + It), 
4 (if. + 4t' "I" 



iiv). 



i«/o - 16 iiu + V + w), 16/, =^ 16 {{u + ^^;)^ 



^^J2^ 16T?/. 



Tlien consider the relation 

where e and rj are mere symbols, and 0, 1, etc., denote their powers, u, ?/ refer 
respectively to father and mother, and. their powers denote the number of allogenic 
couplets in the zygotes of father and mother. Then the above is a symbolical relation 
which gives, by equating any power or product of e and rj on either side, the offspring 
of a pair of parents of definite constitution. 

Now suppose the parents not to consist of a single couplet, but of //. couplets, then 
the total distribution, of oifspring that we have given above for any couplet may occur 
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in eacli couplet, and each such distribution must be conibined with every other couplet 
distribution. We tlien I'each, dropping unnecessary indices, tiie general symbolic 



+ V'''€" 



rei.atioir 



^ 4'^^ X 4'^ X (io +ji (^ + v) + h^vY^ 

Tlius tlie array of olispring due to parents of zygotes witli p aiid q allogenic 

couplets respectively— i,e,, to iiJ'X'if/''-- is the coefficient of e^'-^'^ on the riglrt-liand side, 

or in the expansion of 

4'^^ X 4'^ X U, +i] (e + v) +J:i^vy' 

This may be written 

4" X 4« X U,+j^l}e+j,+J-^)"- 



Thus the coefficient of 



e''' :!S 



4«X 4"X {Ji+J~ir)}"ih+JiVr "<^n,„,o: 

e recjuire to pick the coeffi.cien.t of rf' oirt of this in ordej; to get the array ol 
olispring due to f^ithers of p^ and. to mothers of (/, alJogenic couplets. But this is 
clearly 

T J-1 ./i J{) O'-' 'Pi ^/', -r-ho 



^ Ji' ' J\ 



-f 






E- fi fl 



or, more briefly, 



4" X 4" X o„,,^A'B{jr-jr"""'j 



u-/i-r 










We shall iii'st fiud the mean and frequency coi'resjjonding to the t'-^' term as given 
above of this series« What we have to deal witli is 



(i^O'''^'' (i'^^ + i^)'^""''"' {:h^ + ^-^ + ^^)" 



-i>- 



We may write this 



gii-'Tl-r s^ ,^^_r-r 



4 



7b 



X (i« + Ix)"-^^^-^-'' X {}u + |i' + 2x')"'"^'"'- 



where x. ^ ''•' ^^^ X^ ^ 3^' ^" t^^'? ^^-^^I- both may he jjut unity when we are merely 
iinding the distribution of allogenic couplets. 

Now the general term in the above expression is 



[yn — {/ + r \y ,, .0 — r 



X i 



4 



/(- 



X {iu+ |.^) <'--"+*■+' (2x0" ■'' '-'c 



n—p— }', «, 05 
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and 5 must be taken from o to n — p — r. The iToquency of this term is 

An ^ ^n-p~r,s,o^ 

and its mean is q — r + h {p ^ 9. ^ 2f' + .9) allogenic couplets. 
The total frequency of the r*^^ term, is therefore 

^ (1 + 2)^^---^^"-^' = ^-^- , 

which of course must ultimately be multiplied by the factorials in r omitted above. 
If rrir be the mean number of allogenic couplets in the t*^' term, we have 

m,. X "-^^ - = S^ ( -' ^„ - 2''-^'-'-^a„_,,„,, ., „ {g - r + i ( p - g + 2r + s) \ 

= I5 _ t' + i ( p -- g + 2r)} ^-^ ^^^ ~^ 4„ 1 (t^ « p ^ r) ~- ^^^~.--~- 

Thus : 



= ¥^ + k + |p - 1'^- 

This is the mean of the r*^' term, and its total frequency is 



4' 






Hence, if 



F = 4'^'- X 3^2^''-/'""^ S c,^^,,^, X 0.-,^ 






'r=o 



4i>-^n.^p-q x/; 



we shall have 



T=^q 



fX m^-,y - S {{^n 4- |g + |p) - 1^"} c„,p,,. X c,,,,_,, „ , 



r = () 



where m^^j is the mean of the array of offspring due to fathers of p and mothers of g 
allogenic couplets. 

The only difficulty liere is summing the series 



*"^ i ' ^'-''h P, r -^ ^-'2), q-r, ) ° 



But this may be written 



or multipled by 4«""i32(«--i)-;>-(??-i)/.^2. it represents the total offspring of fathers ofp and 
VOL. coin. — A. M 



rji 



Chis leads us to 



Tihpq 
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mothers of § — I allogenic units in a population with [n — 1) couplets in their 
constitution 

Hence 

Now 
Ihus 

or J if y^ denote the above series, we have 

^ 1,, j^ 2^ ±. 2,. ^, 4 q{n—p) 

= Ipi 4" 9 (p + f/) + () ^ 

_ 1 (^'^ + 2^3) (91 + 2g) 

This is a most remarkable result, for it shows that the regression surface is not a 
plane but a hyperboloid. Let us measure all the quantities in deviations from the 
mean of the general population, ^.e., put m^^j = m^^ + 7f^^s p = p^ + i?^? 9 — (/ + 4'/^' 
We find 

m^prj = K j)/ 4. g^) + f P - / « 

This formula reconciles at once the Mendelian and Galtonian positions. When the 
number of couplets is large, parents having a number of allogenic couplets comparable 
with n are vanishingly small in number. The standard deviation a of the population 

is \/37i/4 (see p, 57). , 
Hence we may write 



\ 2 J 3(T V 3 



?i 



But jy can only once per thousand cases be as big as 3or, and accordingly when n is 
large, both terms of the product will be small In this case the surface becomes 
practically planar, and we have the Galtonian result of 1885,"^^ that the offspring from 
the Galtonian ^^midparent" are one-thbd nearer the general mean of the population. 
On the other hand, when 71 is small we see that for midparents not differing too 
widely from the population mean, Galtonian regression of the value ^ holds, but that 

* ' NaturaMnlieritance/ p. 97, 
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as we pass away from the mean of the population the regression gets less and less, 
becoming absolutely zero when we take two pure allogenic parents. Just as the 
regression is reduced when we emphasise the allogenic constitution of the parents, it 
is increased when we emphasise the non-allogenic elements. 

To illustrate these points I have drawn a diagram for n == 9 of the contour lines of 




^ Number of Allogenic Couplets in Father- 

Midparental contours. 

the regression surface as a plane and a hyperboloid. The area ABGD contains all 
possible parents; 999 out of 1000 random matings fall in the loop ahcd; 99 out of 
100 random mating within the loop (ilVdd!, which has been carried right round to 
mark that it excludes matings in which both parents have no allogenic couplets. The 

M 2 
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nieaus of the array of offspring due to any pair of parents are marked in slim figures 
along the Mendelian hyperbolic contours, and in heavy figures along the Galtonian 
straight lines. We see comparatively small differences as long as we deal with 
m.atings within the 99 per cent, loop, somewhat grea>ter difl:erences as we approach 
the 99*9 per cent, loop, and very marked differences as we go beyond the latter 
boundary towards pure allogeiiic parentage. A. study of the diagram illustrates at 
once how the Mendelian theory exhibits for the bulk of the population Galtonian 
regression, such regression becomiing, however, less and less as we proceed to 
individuals, the frequency of whose matings may be less than one in a million. 

It will be seen again that in this proposition we have no fundamental antagonism 
between the Mendelian and biometric standpoints. We reach a single formula which 
approaches more and more closely to the biometric standpoint wlien we deal with 
characteTS depending on niany allogenic couplets/"' On the other hand, it gives the 
absence of regression which is obtained when pure allogenic parents are mated. We 
see, however, quite clearljr that it is totally erroneous to argue from this single case 
against regression in general. Such regression actually exists on " Mendelian 
Principles " when any population breeding at random, is taken, and involves in itself 
the whole conception of ancestral correlation and the influence of ancestry. 

Of the formula for the midparent nov/ reached, however, we can only say that on 
the basis of our experience in populations the factor | seems too inelastic to work. 
Here again the data must be especially investigated from the standpoint of a 
midparent given by 

i{/ + ^0 + f*'^^' 



2F 



n 



before judgment can be final on this test. 

Theoretically, by assuming the midparent to be 

i(y + sO + x/?'^ 

we should have a means of finding ;v by averages, and therefore n, the number of 
couplets involved. 

( 1 2. ) General Conchisioiu^, 

In this paper we have dealt with a general theory of the pure gamete -possibly 
not the widest that could be conceived^ but sufficientlv wide to indicate the real 
beai'in0' of ¥Lend.elian fbrmuko when applied to a poDulation mating' at random. We 

c y ,1. .1. J. J. o 

see that under such circumstances : 

(i.) The ]:)opulation which results from tJie offspring of liybrids remains stable; 
every variation which appears, appears witli a certain definite and predicable 

"^- For 71 = 50 or 100 the population Avitliin tlie I in 1000 lirio sensibly obeys ordinary linear midparenta.l 



regression a 
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frequency ; there is no room for the appearance of '' mutations/' although certain 
variations with very small frequency would be extremely rare in a limited population. 
A mutation. — a variation not hitherto observed — would only appear in the ofispring 
of the hybrids between two pure races ; after this with random mating the mixed 
race would remain perfectly stable until disturbed by sexual or natural selection. 
These are the only mutations which arise on the generalised theory of the pure 
gamete, i.e., two pure races form one mixed race, breeding true to itself; it is difficult 
under these circumstances to account for the origin of the two pure races by a 
mutation-theory of the difierentiation of species ! 

(ii.) Between any two relations — if we measure the character by the number of 
allogenic or protogenic couplets in the zygote of the individual — we have a linear 
regression. The frequency distribution of any character is skew, approaching closely 
the normal distribution as the number of couplets which determines the constitution 
of the zygote is increased. 

(iii.) The correlations between pairs of blood relations take definite numerical 
values absolutely independent of the number of couplets, and the same for all 
characters and races. 

(iv.) The ancestral correlations form a geometrical series of common ratio one-half 

(v.) Fraternal correlation is fixed between narrow limits depending on the number 
of brothers per family dealt with, and is very slightly larger than parental 
correlation. 

( vi. ) The theory of the midparent for a considerable number of couplets approaches 
closely that originally given by Francis Galton, except for extreme values of the 
character, when the regression, becomes rapidly smaller and ultimately vanishes. 

We thus see that a generalised theory of the pure gamete would be of very great 
advantage if it could be accepted. It would lead to a system of inheritance in 
randomly mating populations with non-drfferential fertility, which in its broad 
features would be essentially the same as that which has been biometrically 
developed not from theoretical hypotheses, but from the statistical description of 
observed facts in populations. 

Unfortunately, however, when we come to the actual numerical values for the 
coefficients of heredity deducible from such a theory of the pure gamete, they do not 
accord with observation. They diverge in two ways : First, they give a rigid value 
for these coefficients for all races and characters — a result not in reasonable 
accordance with observation. Secondly, they give values distinctly too small, as 
compared with the average values, or with the modal values of large series of 
population observations. 

We thus reach the point we have so often had to insist upon : that the biometric 
or statistical theory of heredity does not involve a denial of any physiological theory 
of heredity, but it serves in itself to confirm or refute such a theory. Mendelian 
formulaB analytically developed for randomly mating populations are either consistent 
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or not with the biometric observations on such, oopuiations. If they are consistent, it 

-1,- J. ■ty -^ 

shows their possibihty, but does not prove their necessity. If they are not, it shows 
they are inadequate. The present investigation shov/s that in the theory of the pure 
gamete there is nothing in essential opposition to the broad features of linear 
regression, skew distribution, the geometric law of ancestral correlation, etc, of the 
biometric description of inheritance in popuJ.ations. But it does show that the 
generalised theory here dealt with is not elastic enou^li to account for the numerical 
vahies of the constants of lieredity hitherto observed. 

It will be time enoup-h to consider other more or less general Mendelian formu.JiB 
when there is far better evidence than exists at present that they cover a real range 
of observation, and have not been solely invented to describe isolated exijeriences, the 
numerical results of which are not in complete accordance v/ith simple Mendelianism. 
Given such neo-Mendelian formulae, there is a perfectly straightforwT3.rd mathematical 
method of applying them to randomly mating po])ulations, but that method is 
excessively laborious, and the biometrician may well hesitate to undertake the task 
of their investigation. A few minutes suftice to invent a Mendelian formula, but 
weeks of labour mav be involved in testing' whether it leads to lep'itimate results 

^ CD O 

when applied to sexually crossing races. Let us therefore have a few simple general 
principles stated which embrace all the facts deducible from the hybridisation 
experiments of the Mendelians ; these can form, the l-)asis of a new mathematical 
investigation, but it is idle to undertake such a.n investigation so long as Mendelian 
Principles remain in a state of flux, 

.Any combination of the theory of pure gametes here discussed with homogamy, or 
with fertility correlated wuth homiogamy, or again vuth prepotency of individual or of 
type, would emphasise the correlations which we have found above to be too low^ ; 
but such hypotheses would involve a fundamental alteration in the formula 

{a + a') (A + A/) == aK + a.A' + (/K^ + AA^ 

Such a formula would then give the possibilities of the cross, but the proportions of 
these possibilities actually occurring would be quite different."" 

Such loading of the possibilities — not only of the individual coup.l.et- -but very 
probably of associated couplets in. the constitution- -might conceivably enable us to 
deduce better values for the ancestral and collateral, correlations. But it would 
abolish not only the simplicity of the f\nidamental Mendelian formida, it would also 
involve lengthy preliminary studies on homogamy, fertility, and prepotency before 
any effective formula could be propoimded. 



-^ ''! 



Toss two pennies, and the result of in tossings will closely approximate to the distribution 
7i(IiH + 2HT + TT). Load one or both coins, and the possible variations v/ill still be HH, HT or TT, but 
their proportions wiJl ])e far from n : 2n : n. 
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